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Sir: 



1 . I, Caius Rommens, have been employed as Director R&D at J. R. Simplot 
Company (Boise, Idaho), since 2000. 

2. I have doctorate in Plant Biology from the Free University of Amsterdam. I 
have worked extensively in agricultural genetics throughout my professional career. I attach 
a Curriculum Vitae evidencing my research in this field. See Exhibit 1 . I authored the peer- 
reviewed article Rommens et aL, "Plant-derived transfer DNAs" Plant Physiol 1 39: 1338- 
49, 2005 (Exhibit 2), for example, which describes our presently claimed invention. 

3. In this regard, I am the project director and co-inventor of the present 
application, USSN 10/607,538, which discloses and claims key Simplot technologies. Our 
"all-native" green biotechnology approach to genetic engineering of plants is novel and is 
generating much excitement and interest in the agricultural community. Our goal, which is 
evidenced throughout our application, is to transform plants without necessarily incorporating 
bacterial or viral or non-plant DNA into the recipient plant's genome. 

4. We have successfully achieved this goal as evidenced by the specification and 
examples disclosed in our application. Thus, we prove, by way of our originally-filed 
specification, that it is possible to transform a plant with a desirable gene or polynucleotide 
without necessarily incorporating any "foreign" DNA into the plant genome. This is a major 
breakthrough on many different levels, especially those dominating the agricultural and 
genetic engineering communities. The purpose of my declaration is to further explain and 
clarify our claimed invention, particularly with respect to present claim 3 under examination. 
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5. In this respect, I have read the Office Action dated May 23, 2006, and 
understand the Office's position regarding enablement as follows: 

[G]iven the claim breadth, unpredictability, and lack of 
guidance . . . undue experimentation would have been 
required ... to identify and isolate a multitude of non- 
exemplified P-DNAs or "transfer-DNAs" of a multitude of 
lengths and sequences, from a multitude of plant species 
and genomic regions [as well as to functionally] evaluate 
said non-exemplified sequences (page 13, lines 4-13). 

I offer the following commentary and evidence to respectfully rebut this conclusion. 
Specifically, I clarify how the application provides explicit guidance for identifying and 
isolating border-like sequences, and how these border-like sequences can be readily tested to 
determine their respective ability to support plant transformation. I will explain and prove 
that the methods disclosed in the application most certainly do yield predictable results . 

6. Two particularly interesting methodological aspects of our invention, which 
are extensively disclosed in the present Precise Breeding application, include (1) the 
identification and isolation of plant-derived sequences that resemble, but are not identical 
to, the nucleotide sequences of Agrobacterium T-DNA borders, and (2) the use of such 
border-like sequence for plant transformation . 

7. It is clear from the studies disclosed in our application and our subsequent use 
of those disclosed methods, that predictability is not a questionable issue . As I elaborate in 
this declaration, our disclosed methods for identifying and isolating sequences that conform 
to nucleotide composition requirements of transformation borders is predictable. Secondly, 
our functional assay is easy to perform and does not involve any undue experimentation. 
Simply put, our disclosed functional assay predictably identifies which border-like sequence 
is effective in transforming a plant. ^ 

8. A valuable outcome of our extensive and accurate analyses of the nucleotide 
sequences that we isolated, identified, and tested for functional activity led us to define a P- 
DNA border-like sequence by all of the following criteria: 

(i) it consists of 25 nucleotides and displays between 52 and 96% identity to a 
functional Agrobacterium T-DNA border; 

(ii) it can (a) be identified by searching databases for genomic and/or cDNA 
sequences of plants with any query sequence that complies to the Agrobacterium T-DNA 
border motif YGRYAGGATATATWSNVBKGTAAWY, or (b) be isolated by employing 
polymerase chain reactions using primers that anneal to T-DNA border sequences; 

(iii) it enables the transformation of a plant cell with DNA from a plasmid that 
contains the P-DNA border-like sequence. 

I discuss this enabling disclosure in more detail below. 
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9. Our specification discloses particular methods for identifying and isolating 
border-like sequences. For example, we relate that one may (1) search DNA databases and 
(2) employ the polymerase chain reaction (PCR) to identify sequences and actual DNA 
molecules that satisfy our inventive "border-like" criteria. See the following paragraphs. 

10. With respect to searching DNA databases for border-like sequences, the 
specification provides: 

(A) Examples of databases that can be searched for sequences that share identity 
with a non-T-DNA border sequence and/or which meet the requirements of our border-like 
consensus sequence (see paragraph 326), e.g., (i) The National Center For Biotechnology 
Information, (ii) The Institute for Genomic Research, and (iii) SANGER; and 

(B) Search tools for performing searches, e.g., (i) Motif Alignment and Search Tool 
(Bailey and Gribskov, Bioinformatics 14: 48-54, 1998), and (ii) advanced BLASTN ("penalty 
for nucleotide mismatch" = -1; "expect" = 10 5 ; Altschul et al., Nucleic Acids Res 25: 3389- 
3402, 1997) (see paragraph 327). See, for example, Exhibit 3. 

1 1 . With respect to using PCR to physically amplify and isolate border-like 
sequences, the specification provides "one-step" and "two-step" amplification protocols. 

(A) One-Step PCR border-like DNA isolation method 

(a) The present specification describes how polynucleotides that comprise border- 
like sequences can be amplified from plant DNA using PCR primers that are designed to 
anneal to border or border-like sequences. See paragraph 320 of the specification. 

(b) The common term "anneal" is readily understood by any molecular biologist. 
An annealing primer may represent either the exact inverse complement of a target sequence 
or it may be somewhat dissimilar in sequence to the target. Nonetheless, whether the primer 
is or is not an exact complement of the target, it should be able to hybridize with its target 
sequence at a set "annealing" temperature. Paragraph 47 explains that primers may be 
degenerate. 

(c) Specific examples of primers are shown in paragraph 165 
(CAGGATATATNNNNNKGTAAAC), paragraph 321 

(TGRCAGGATATATNVNDNTGTAAAC) and paragraph 330 ( YGRC AGG AT AT AT) . A 
variety of such primers that were designed to anneal to border sequences were used to isolate 
border and P-DNA sequences from potato, e.g., as depicted in SEQ ID NO. 1. See paragraph 
320. 

(B) Two-Step PCR border-like DNA isolation method 

Plant DNA fragments with an unknown sequence were ligated to a known sequence. 
By using the resulting DNA as a template, it was possible to use (i) a single primer designed 
to anneal to a border sequence (the border primer) and (ii) a primer that anneals to the known 
sequence (the anchor primer) to perform a two-step PCR. See paragraph 321-330. An 
example of the method also is shown in Exhibit 4. 
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Step 1 entails (a) digesting total DNA with a restriction enzyme such as SauIIIA; 
(b) ligating the restriction fragments to a known polynucleotide, such as to the 192 bp 
BamHI-EcoRV fragment of pBR322; (c) performing a first PCR with a border primer 
(e.g., YGRCAGGATATATNNNNNKGTAAAC) and an anchor primer (e.g., 
GACCACACCCGTCCTGTG); and (d) ligating a PCR product to a second known 
polynucleotide such as pGEM-T; 

Step 2 entials (a) using the ligated DNA for a second PCR with a border primer and 
second anchor primer such as SP6 or T7; (b) cloning and sequencing the products; (c) using 
the sequenced DNAs to design four primers, and performing an inverse PCR on plant DNA 
that was digested with SauIIIA and self-ligated; and (d) cloning and sequencing the resultant 
amplified products to obtain actual border-like sequences. 

12. These disclosed database-searching and PCR-based techniques ultimately 
produce a collection of plant-specific DNA sequences that can serve as potential T-DNA 
border-like elements. In this regard, we make clear in our application tha a functional plant- 
specific border-like sequence does not have a nucleotide sequence that is identical to a 
conventional T-DNA border sequence, e.g., a left border or a right border. Accordingly, our 
reproducible methods help to uncover genetic elements that are useful in transferring one 
polynucleotide to another, e.g., from an Agrobacterium vector to a plant genome. 

13. We teach in our application how to test the functionality of any one of those 
DNA sequences. See paragraph 345, as well as paragraph 20 below. Collectively, therefore, 
our methods provide DNA sequences that are (a) from a plant and (b) functional in 
transferring polynucleotides into, for instance, a plant genome. These DNA sequences 
therefore can be used to replace conventional T-DNA left and right border sequences in any 
given transfer cassette. 

14. The functional DNA sequences can be compared with each other to determine 
whether there exists any commonality in sequence or nucleotide residues that can be used to 
characterize the sequences as a whole. This is precisely what we did in the present 
application. We analyzed the nucleotide sequences of the isolated genetic elements and 
derived a border motif nucleotide sequence. The border motif simply provides guidance 
concerning those nucleotide residues and features that we typically identified from our 
functional border-like sequence analysis. The consensus sequence does not signify any 
particular sequence or sequences that a potential DNA fragment must comport with in order 
to be characterized as a plant border-like sequence. 

15. From our sequence searches of the disclosed databases and from our technical 
expertise in amplifying borders from plant genomic DNA, therefore, we identified and 
collated numerous border-like sequences. Our analysis of various border sequence 
comparisons (see paragraph 262) helped us to create the T-DNA border motif disclosed in 
Table 2 of the application, namely YGRYAGGATATATWSNVBKGTAAWY . where Y = 
C or T, , R = A or G, W= A or T, S= C or G, K = G or T, V = A, C, or G, B = C, G, or T, and 
N= any nucleotide. 

16. One particular sequence of this border motif is 

CG AC AGG AT AT ATTTGTTGGT AATC . This sequence was used to conduct an electronic 
BLAST search of the NCBI GenBank database with this sequence and identified sequences 
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that shared sequence identity. From those comparisons, we identified plants, e.g., pepper, 
tomato, and maize, which actually contained corresponding endogenous sequences. 

17. Within that border motif sequence, we identified a central core region 
ANGATNTATN6GT that can be important in predicting functional activity in mediating 
plant transformation. See SEQ ID NO. 93 and paragraph 219. 

18. Hence, we (1) created, from our analysis of numerous border sequences, a 
border motif sequence; (2) deduced polynucleotide sequences conforming to that consensus; 
(3) identified endogenous and native sequences that shared significant sequence identity with 
our deduced sequence, (4) designed PCR primers based on consensus sequences that can be 
used to isolate functional border-like sequences for any plant species, and (5) identified a 
central core region that often represents a good predictor for functional activity of the native 
sequences. 

1 9. In fact, by way of these disclosed methods, we successfully identified forty- 
one new border-like sequences . See Rommens et aL supra. 

20. In addition to those border-like sequence isolation and identification methods, 
we also disclosed in our specification a method to test the functionality of newly-identified 
border-like sequences. We performed this exact method to demonstrate that the potato P- 
DNA of SEQ ID NO. 1 could successfully transform tobacco. See paragraph 345. To 
summarize, our specification teaches the following functional assay : 

(a) insert the border-like sequence and a neomycin phosphotransferase (nptIT) gene 
into the linearized DNA of a plasmid that lacks T-DNA border sequences, such as pSIMlOO- 
OD-IPT; and then introduce that plasmid into an Agrobacterium strain. See paragraphs 31, 
341, and 344. 

(b) infect an explant of, e.g., tobacco, with the Agrobacterium. See paragraph 345. 

(c) identify the subsequent formation of kanamycin resistant calli. Explants that 
contain the nptll gene will be kanamycin resistant and therefore identifiable as successful 
transformation events. The number of individual calli that are identified serves as a general 
indicator of the transformation efficiency of the border-like element. See paragraph 345. 

2 1 . We also used the traditional tobacco model to test the transformation 
capabilities of numerous border-like sequences that we had identified and isolated from 
Arabidopsis, potato, tomato, barrel medic, alfalfa, barley, canola, corn, wheat, and rice. 
We have isolated over three dozen border sequences (1 from Arabidopsis, 6 from potato, 17 
from tomato, 2 from barrel medic, 1 from Alfalfa, 3 from barley, 2 from Brassica, 1 corn, 1 
wheat, and 3 from rice). None of these sequences is identical to the Agrobacterium T-DNA 
border consensus. On the other hand, the P-DNA border-like sequences comprise a sequence 
that is highly homologous or identical to the core consensus sequence ANGATNTATN 6 GT. 
Please see Exhibit 5. The disclosed consensus sequences provide guidance concerning what 
constitutes a functional border-like sequence. 

22. Our disclosed identification and functionality methodologies, and subsequent 
corresponding results, also informed us that an efficacious border from one plant species is 
very likely to be effective at transforming another plant species. For instance, we determined 
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that a pSIMlOO-OD-IPT-derived plasmid with the alfalfa border-like 
CGGCAGGATGTATACAGAGGTATAC yielded transformation frequencies 
in alfalfa and 168% in tobacco if compared to a conventional T-DNA 
efficacy of that alfalfa border-like sequence is effective in the transforjnation 
species. Similarly, the tomato border-like sequence 
GGCCAGGATATATTTGTTGGTAATG supported a transformation 
100% in tobacco, tomato, and potato. Transformation methods for alfklfa 
Agrobacterium-based, and referred to in paragraph 25 1 . The Agrobac 
disclosed in the application specification were used to transform plant! 
and tomato border-like sequence carried on modified pSIM 100-OD- 
paragraphs 31, 341 and 344). 



Sequence 

of about 127% 
/ector control. The 
of other plant 

frequency of about 

and tomato were 
erium methods 
with the above alfalfa 
vectors. (See 



23. With respect to co-transfer, I appreciate that a co 
employs two T-DNA vectors can yield different transformation results 
DNA/P-DNA vector combination. Those different results, however, 
predictability of those alternative co-transfer combinations . That 



transifcrmation event that 
compared to a T- 
not negate the 



do l 



DNA/P-DNA co-transformation event may be "ten-fold" less effective 
DNA combination at integrating a desired polynucleotide (paragraph 
DNA/P-DNA arrangement still works. Indeed, we teach how to improve 
efficiency by selectively enhancing desired integration events. See 



j, even though a T- 
than a T-DNA/T- 
85 and 389), the T- 
transformation 
es 10-14. 



ExampL 



ex< mplified < 



24. It is clear, therefore, that the disclosed methods, the 
and the extensive guidance provided by our specification for transfornjing plants 
identified border-like native polynucleotides, are predictable and succ< ssful 
identifying border-like sequences from plant genomes works. The me hod 
function of those border-like sequences works. The methods do not burden 
with any undue experimentation. Not only do the methods work, and 
but, iinportantly, they are predictable. 



embodiments, 
with newly- 
The method for 
for testing the 
the skilled person 
ire easy to perform, 



25. Accordingly, contrary to the Office's position, we do ic 
methods to identify and isolate a multitude of P-DNA border sequence s. 
also do in feet functionally evaluate those exemplified border-like 
are reproducible and predictably yield DNA fragments that can be rea< 
functional ability to act as a border-like genetic element. 



fact provide the 
Furthermore, we 

Our methods 
tested for their 



seqt ences, 



( ily 



26. I hereby declare that all statements made herein of my 
and that all statements made on information and belief are believed to 
these statements were made with the knowledge that willful false statements 
made are punishable by fine or imprisonment, or both, under Section 
United States Code, and that such willful false statements may jeopard L 
application or any patent issued thereon. 



dwn knowledge are true 
)e true; and further that 
and the like so 
of Title 18 of the 
ze the validity of the 
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EXHIBIT 1 
DR. ROMMENS' CURRICULUM VITAE 



Research Experience: 

• Director Plant Sciences at J.R. Simplot Company, ongoing from September 2000. 

o Developed new methods for plant transformation and gene silencing (see, for instance, US 

Patent Applications 2003/022121 3 Al, 2005/0034 188A1, and 2006/0 156428A1) 
o Developed new strategies for crop improvement 

• Team Leader Disease Control at Monsanto Company, from March 1995 until August 2000 

o Developed strategies for enhanced disease resistance in agriculturally-important crops (see, 

for instance, US Patents 6544733, 6506962, and 7030293) 
o Developed strategies for improved plant transformation (see, for instance, US Patent 

Application 2004/01 3393 8A1). 

• Postdoctoral Fellow at the University of California from January 1993 until February 1995. 

o Isolated various disease resistance genes (see US Patent 62455 10) 

• PhD Student at the Free University Amsterdam from Apr 1988 until December 1992. 

o PhD thesis of December 14 th , 1992, entitled "Transposition of the maize Activator element in 
tomato" 

Partial List of Publications: 

Rommens CM (2006) Kanamycin resistance in plants: an unexpected trait controlled by a potentially 
multifaceted gene. Trends Plant Sci Jun 13; 1 1: 317-319. 

Yan H, Chretien R, Ye J, Rommens CM (2006) New Construct Approaches for Efficient Gene 
Silencing in Plants. Plant Physiol Jun 9; 141: 1508-1518. 

Rommens CM, Bougri O, Yan H, Humara JM, Owen J, Swords K, Ye J (2005) Plant-derived transfer 
DNAs. Plant Physiol 139: 1338-1349. 

Rommens CM (2004) All-native DNA transformation: a new approach to plant genetic engineering. 
Trends Plant Sci 9 :457-464. 

Rommens CM, Humara JM, Ye J, Yan H, Richael C, Zhang L, Perry R, Swords K (2004) Crop 
improvement through modification of the plant's own genome. Plant Physiol 135: 421-3 1. 

Rommens CM, Kishore GM (2000) Exploiting the full potential of disease-resistance genes for 
agricultural use. Curr Opin Biotechnol 11: 120-125. 

Gao AG, Hakimi SM, Mittanck CA, Wu Y, Woerner BM, Stark DM, Shah DM, Liang J, Rommens 
CM (2000) Fungal pathogen protection in potato by expression of a plant defensin peptide. Nat Biotechnol 18: 
1307-1310. 

Abad MS, Hakimi SM, Kaniewski WK, Rommens CM, Shulaev V, Lam E, Shah DM (1997) 
Characterization of acquired resistance in lesion-mimic transgenic potato expressing bacterio-opsin, Mol Plant 
Microbe Interact 10: 635-645. 

Salmeron JM, Oldroyd GE, Rommens CM, Scofield SR, Kim HS, Lavelle DT, Dahlbeck D, 
Staskawicz BJ (1996) Tomato Prf is a member of the leucine-rich repeat class of plant disease resistance genes 
and lies embedded within the Pto kinase gene cluster. Cell 86: 123-133. 

Rommens CM, Salmeron JM, Oldroyd GE, Staskawicz BJ (1995) Intergeneric transfer and functional 
expression of the tomato disease resistance gene Pto. Plant Cell 7: 1537-1544. 
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Rom mens CM, Salmeron JM, Baulcombe DC, Staskawicz BJ (1995) Use of a gene expression system 
based on potato virus X to rapidly identify and characterize a tomato Pto homolog that controls fenthion 
sensitivity. Plant Cell 7: 249-57. 

Rommens CM, van Haaren MJ, Nijkamp HJ, Hille J (1993) Differential repair of excision gaps 
generated by transposable elements of the 'Ac family 1 . Bioessays 15: 507-12. 

Rommens CM, Munyikwa TR, Overduin B, Nijkamp HJ, Hille J (1993) Transposition pattern of a 
modified Ds element in tomato. Plant Mol Biol 21 1 109-1 1 19. 

Rudenko GN, Rommens CM, Nijkamp HJ, Hille J (1993) Supported PCR: an efficient procedure to 
amplify sequences flanking a known DNA segment. Plant Mol Biol 21 : 723-728. 

Rommens CM, Rudenko GN, Dijkwel PP, van Haaren MJ, Ouwerkerk PB, Blok KM, Nijkamp HJ, 
Hille J (1992) Characterization of the Ac/Ds behaviour in transgenic tomato plants using plasmid rescue. Plant 
Mol Biol 20: 61-70. 

Rommens CM, van Haaren MJ, Buchel AS, Mol JN, van Tunen AJ, Nijkamp HJ, Hille J (1992) 
Transactivation of Ds by Ac-transposase gene fusions in tobacco. Mol Gen Genet 23 1 : 433-441 . 

Rommens CM, van der Biezen EA, Ouwerkerk PB, Nijkamp HJ, Hille J (1991) Ac-induced disruption 
of the double Ds structure in tomato. Mol Gen Genet 228: 453-458. 

Haring MA, Rommens CM, Nijkamp HJ, Hille J (1991) The use of transgenic plants to understand 
transposition mechanisms and to develop transposon tagging strategies. Plant Mol Biol 16: 449-461. Review. 

Haring MA, Gao J, Volbeda T, Rommens CM, Nijkamp HJ, Hille J (1989) A comparative study of 
Tam3 and Ac transposition in transgenic tobacco and petunia plants. Plant Mol Biol 13:1 89-20 1 . 
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Exhibit 2 



"Plant-derived transfer DNA's", Plant Physiol. 139:1338-49, 2005. 
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Plant-Derived Transfer DNAs 

Caius M. Rommens*, Oleg Bougri, Hua Yan, Jaime M. Humara, Joanna Owen, 
Kathy Swords, and Jingsong Ye 

J.R. Simplot Company, Simplot Plant Sciences, Boise, Idaho 83706 

The transfer of DNA from Agrobacterium to plant cell nuclei is initiated by a cleavage reaction within the 25-bp right border of 
Ti plasmids. In an effort to develop all-native DNA transformation vectors, 50 putative right border alternatives were identified 
in both plant expressed sequence tags and genomic DNA. Efficacy tests in a tobacco (Nicotiarm tabacurh) model system 
demonstrated that 14 of these elements displayed at least 50% of the activity of conventional Agrobacterium transfer DNA 
borders. Four of the most effective plant-derived right border alternatives were found to be associated with intron-exon 
junctions. Additional elements were embedded within introns, exons, untranslated trailers, and intergenic DNA. Based on the 
identification of a single right border alternative in Arabidopsis anH three irt rice {Oryza saliva), the occurrence of this motif was 
estimated at a frequency of at least 0.8 X 10 ~ 8 . Modification of plasmid DNA sequences flanking the alternative borders 
demonstrated that both upstream and downstream sequences play an important role in initiating DNA transfer. Optimal DNA 
transfer required the elements to be preceded by pyrimidine residues interspaced by AC-rich trinucleotides. Alteration of this 
organization lowered transformation frequencies by 46% to 93%. Despite their weaker resemblance with left borders, right 
border alternatives also functioned effectively in terminating DNA transfer, if both associated with an upstream A[C/T]T[C/ 
G]A[A/T]TIG/T][C/T][G/T][C/G]A[C/T][C/T][A/T1 domain and tightly linked cytosine clusters at their junctions with 
downstream DNA. New insights in border region requirements were used to construct an all-native alfalfa (Medicago sativa) 
transfer DNA vector that can be used for the production of intragenic plants. 



Agrobacterium-mediated plant transformation is 
based on the transfer of single-stranded DNA from 
Agrobacterium plasmids to plant cell nuclei. DNA 
transfer is initiated by virD2-mediated cleavage within 
the 25-bp right border sequence (Wang et al., 1984; 
Scheiffele et al., 1995). The Agrobacterium plasmids 
pTiC58 and pRi8196 contain additional sequences in 
the vicinity of the right border that enhance cleavage 
(van Haaren et al., 1987; Hansen et al., 1992). Upon 
initial cleavage, virD2 covalently binds to the 5' side, 
and the DNA unwinds toward the left border where 
a second cleavage reaction may occur. The released 
single-stranded DNA is coated with virE2 and pro- 
cessed for transfer via type IV-type secretion (Lessl 
and Lanka, 1994; Zupan and Zambryski, 1995). In the 
plant cell, nuclear localization signals of virD2 target 
the transfer DNA (T-DNA) to the nucleus for sub- 
sequent genomic integration (Shurvinton et al., 1992; 
Ziemienowicz et al., 2001). Because left borders often 
fail to function as sites for secondary cleavage, trans- 
formed plant cells frequently contain T-DNAs that are 
still attached to plasmid backbone sequences (Kononov 
et al., 1997). Furthermore, inadvertent right border 
activity of left borders can also result in the trans- 
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formation of plant cells that contain backbone DNA 
(Huang et al., 2004). 

Agrobacterium-derived right border regions have 
been used extensively for the transformation of plants 
with foreign DNA. Resulting transgenic plants often 
displayed new and agronomically important traits 
such as herbicide and insect tolerance. However, the 
presence of foreign DNA in food crops is often per- 
ceived as undesirable, and the application of genetic 
engineering has therefore been limited to a small 
number of crops that are destined for feed, oil, fibers, 
and processed ingredients. In contrast, products closer 
to the table, such as fruits and vegetables, have been 
hindered in their transgenic development (Rommens, 
2004). Public concerns were addressed through a pro- 
posal to diversify genetically modified crops based on 
the genetic distance between the source of new genetic 
material and target organism (Nielsen, 2003). Accord- 
ing to this proposal, the introduction of foreign DNA 
creates "transgenic 77 plants, whereas "xenogenic" plants 
result from the insertion of laboratory-designed DNA 
for which no naturally evolved genetic counterpart 
can be found or expected. Some of these two groups 
of plants deviate substantially from what has been 
achieved through conventional breeding. In contrast, 
rearrangements of genomic material from within the 
same sexual compatibility group would create "in- 
tragenic" plants. Such modifications would often alter 
traits in a similar but more efficient and precise man- 
ner than that of conventional plant breeding. 

One requirement for all-native DNA transformation 
is the replacement of Agrobacterium T-DNAs by 
plant-derived transfer DNA (P-DNA) sequences 
(Rommens, 2004). The concept of P-DNA-mediated 
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transformation has previously been demonstrated in 
potato (Solatium tuberosum). A 400-bp potato P-DNA 
delineated by regions that share homology with the 
left border of nopaline strains and the right border of 
octopine strains was effectively transferred from Agro- 
bacterium to plant cells (Rommehs et al., 2004). The 
potato P-DNA was subsequently used to introduce 
a silencing construct for a tuber-specific polyphenol 
oxidase gene into potato. Resulting intragenic plants 
displayed tolerance against black spot bruise sensitiv- 
ity in impacted tubers. 

Design of optimal P-DNA vectors requires a thor- 
ough understanding of the organization of Agrobac- 
terium T-DNA border regions. Here, we isolated a 
variety of right border alternative sequences from 
plants, determined their original genomic context, 
and tested the efficacy of these elements in mediating 
DNA transfer. We also studied the role of flanking 
DNA, and identified sequences that support the initi- 
ation of DNA transfer. Furthermore, right border 
alternatives outperformed Agrobacterium left borders 
if placed between AT-rich DNA and cytosine clusters 
(C-clusters). Our new understanding of sequences 
involved in DNA transfer makes it possible to effec- 
tively identify and utilize P-DNAs. 



RESULTS 

Right Border Sequence Requirements 

Agrobacterium right borders function in the initia- 
tion of DNA transfer. As shown in Figure 1A, these 
25-bp sequences (Rb01-Rb07) share a highly conserved 
13-bp 5'-ATATATCCTG-[C/T]-CA motif precededjby 
the more degenerate 12-bp consensus S'-fA/C/G]- 
[A/T]-[A/T]-[G/T]-AC-[A/C/T]-N-[C/G/T]-[A/C/ 
G]-[A/C/G]-N (CON1). The sequence plasticity of 
right border alternatives was studied by testing syn- 
thetic elements containing single mismatches to CON1 
(Sy01-Syl3) for their ability to mediate plant trans- 
formation. The elements, flanked by the 109-bp Agro- 
bacterium pTil5955 sequence upstream from Rb02, 
were inserted into a plasmid containing an expression 
cassette for the neomycin phosphotransferase (nptll) 
selectable marker gene. Agrobacterium strains carry- 
ing the resulting test plasmids were used to infect 
tobacco (Nicotiana tabacum) explants. Two weeks after 
infection, the average numbers of calli per explant 
were compared to those produced with control plas- 
mids containing RbOl or. CtOl, whereby 15.3 ± 0.5 
calli/explant = 100% transformation efficiency. As 
shown in Figure IB, all synthetic right border alter- 
natives had maintained the ability to mediate DNA 
transfer. However, base substitutions C6A, A13C, 
C19G, C20G, and T21A of Sy03, Sy07, Syll, Syl2, 
and Syl3, respectively, lowered transformation fre- 
quencies more than 5-fold, implying important nega- 
tive effects of these specific modifications on the ability 
of right border alternatives to initiate DNA transfer. 
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Right border alternative sequence requirements 
were further determined by testing the efficacy of plant 
sequences that either match or resemble CON1 (Fig. 
1C). In addition to the previously characterized ele- 
ment of the potato P-DNA (Rommens et al., 2004), 
designated here as StOl, a lar ge number of new putative 
ri^ht border alternatives was either amplified from 
genomes or identified in publicly available DNA data- 
bases (see ''Materials and Methods' 7 ). Among these se- 
quences, only the Arabidopsis AtOl fully matched the 
Agrobacterium right border consensus. However, this 
element displayed a lower activity (65%) than the con- 
trol borders, suggesting that the guanine base at posi- 
tion +4 (G4) of AtOl and poorly characterized right 
borders such as Rb03 (Jouanin et al., 1986) is not as ef- 
fective as T4 of the more commonly used RbOl and Rb02. 



Despite the pr 



•of one to three mismatches, 13 



right border alternatives from potato (St02, St03, and 
St04), tomato (Lycopersicon esculentum; LeOl, Le02, and 
Le03), pepper (Capsicum annuum; CaOl), alfalfa (Med- 
icago sativa; MsOl), barley (Hordeum vulgare; HvOl), rice 
(Oryza sativa: OsOl, Os02, and Os03), and wheat 
(Triticum aestivum; TaOl) displayed between 50% and 
more than 100% activity compared to the RbOl and 
CtOl control sequences. The functionality of these 
novel sequences indicates that Agrobacterium did 
not exploit the full potential of border sequence 
variation. In contrast to the highly effective right 
border alternatives, 36 additional elements from a va- 
riety of plant species contained multiple mismatches 
and/or point deletions, and either displayed low 
activities (<50%) or appeared nonfunctional. Mis- 
matches that reduced transformation frequencies 
most dramatically include, apart from those men- 
tioned above, A5G and C6G. By comparing the se- 
quences of the two different groups of elements, the 
consensus for plant-derived right border alternatives 
(CON2) was determined as 5'-[A/G/G]-[A/C/T]-[A/ 
C/T1-[G/T]-ACNNNNNNA-[G/T]-A-[A/C/THA/G]- 
TCCTG-[C/G/T]-[A/C/G]-N (Fig. ID). 

Plant-Derived Right Border Alternatives Often 
Represent Sequences Involved in RNA Processing 

To determine the possible function of right border 
alternatives in plants, we analyzed the genomic context 
of some of the most effective sequences. Interestingly 
several expressed sequence tag (EST)-derived elements 
were found to be associated with intron-exon junctions 
(Fig. 2A). For example, genomic DNA corresponding to 
the inverse complement of tomato Le02 contains an 
intron positioned between the bases AG and GT, which 
represent the dinucleotides that most frequently flank 
plant introns (Long and Rosenberg, 2000). Associations 
with intron-exon transitions were also demonstrated 
for tomato Le04 and rice OsOl. In the case of Le04, the 
first six nucleotides 5'-TCAGAG are located in an 
intron (with acceptor site underlined), and the sub- 
sequent 19 nucleotides 5 ' -G ATATATTAG AC AGTT- 
TCC are part of the downstream exon of a gene with 
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Rb01(601) 

Rb02 

Rb03 

Rb04 

Rb05 

Rb06 

Rb07 



COM1 

g Ct01(196) 
Sy01(623) 
Sy02 (624) 
Sy03(625) 
Sy04 (€26) 
Sy05(627) 
Sy06(629) 
Sy07<629) 
SyOB (630) 
Sy09(631J 
Syl0(632> 
«yii(S14) 
Syl2(91S) 
Syl3 (916) 



1 10 20 

GTTTACCCGC CAATATATCC TGTCA pTiCSS [AJ237588) 
AATTACAACG GTATATATCC TGCCA pTilS955 (X004 93) 
CATGACAGGA ACATATATCC TGTCA pRiA4D (K03313] 
AATTACAACG GTATATATCC TGTCA pTiTm4 [U8398 7] 
CCTGACCACA ACATATATCC TGTCA pRiA4 [X51338J 
CTAGACAAGG GGATATATCC TGTCA pR12659 (X68345] 
CATTA CTTTA C AATATATCC ..TGTCA pRil724 (AP002086) 
[aAAGACANCA ANATATATCC TGTCA; 
CTTT C GC C C 
GC T TG G .... 



Transformation frequencies 

0% . 50% 



100% 



GTTTACACAA 
CTTTACACAA 
GTCTACACAA 
GTTTAAACAA 
GTTTACACAA 
GTTTACTCAA 
GTTAACACAA 
GTTTACACAA 
GTTTACACAA 
GTTTA CACAA 
GTTTACACAA 
GTTTACACAA 
GTTTACACAA 
GTTTACACAA 



CAATATATCC 
CAATATATCC 
CAATATATCC 
CAATATATCC 
CAAGATATCC 
CAATATATCC 
CAATATATCC 
CACTATATCC 
CAATATATCC 
CAATAAATCC 
CAATATGTCC 
CAATATATGC 
CAATATATCG 
CAATATATCC 



TGTCA 
TGTCA 
TGTCA 
TGTCA 
TGTCA 
TGTCA 
TGTCA 
TGTCA 
TGGCA 
TGTCA 
TGTCA 
TGTCA 
_TGTCA 
AGTCA 



1 StOl* 
" Le01(551) 
St02* (551B 
Ca01(551C) 
At01lS64) 
St03* (60S) 
St04<737) 
Le02(7l9) 
be03* (560) 
X,e04 (724) 
Le05 (609) 
MgOl* (820) 
HV0K570) 
O60K923) 
O602 (924) 
OB03 (925) 
ZtnOl (927) 
TaOl (928) 
Mt01(568) 
Le06(6O3) 
Le07 (S49) 
Le08(722) 
Le09(721) 
Lel0(608) 
LC11 (727) 
HV02<571) 
Lel2(550) 
Gm01(593) 
Mt02(569) 
OS04 (180) 
SC05* (606) 
Lel4(5S4) 
1*15(731) 
1,616(732) 
Lel7(634) 
Lel8* (546) 
Lel9(553) 
Le20(191) 
L«21(725) 
Le22 (726) 
Le23 (S52) 
Le24(548) 
Le25(723) 
Le28(191) 
2R102 (192) 
AB0K193) 
St08* (604) 
L«29(741) 
Le30<720) 
L631M633) 

D CON2 



GTTTACAGTA 
CATTACCAAC 
) CATTACCAAC 
CATTACCAAC 
GTTGACATCA 
GTTTACCTTA 
ACCTACCCCA 
CTCTACCTCT 
GCATACCTCT 
GGAAACTGTC 
ACCTACCCCA 
GTATACCTCT 
ATATACCAAA 
ACTTACTCAA 
CACTACAAAA 
ATGTACGTAT 
GCGTACGCAT 
ATATACGGAG 
GTATACCTCT 
GTTTAGACTT 
TCTTAGAACT 
CGTTAACACT 
GAATTATTTT 
TTGTTCCTGG 
GGTACCATGT 
CACTACCGCC 
GTTCCGGTTG 
TAAAGCAACA 
GATTAGACAA 
CTCTACTACC 
GTTTGACACG 
GTTTACTTGA 
TTCCATAOGA 
TTCTAGCTGC 
GTTGACATGG 
GTTCAGCTTA 
TTCCAGAAGT 
TGATTGC AT C 
ATCCCCACCC 
CA TCC CCACC 
GT CAGGA AGT 
GTTTAAACCA 
AGTTATAAAC 
ATTAACAACG 
AACTTAAAGA 
CTGTACAATA 
TTTTACCCGT 
GATTGCA-TC 
AAGTACC-GA 
CATTGCA-TC 



CCATAT ATC C 
AAATATATCC 
AAATATATCC 
AAATATATCC 
CGATATATCC 
GGATATATCC 
AAATATATCC 
GAATATATCC 
GAATATATCC 
TAATATATCC 
AAATATATCC 
GTATACATCC 
TGATACATCC 
GGATATATCC 
AAATATATCC 
ATATATATCC 
TTATATATCC 
CAATATATCC 
GTATACCTCC 
GTATATATCC 
CAATATATCC 
GTATATATCC 
GCATATATCC 
CCATATATCC 
AGATATATCC 
TCATAGTTCC 
ACATATATCC 
CCATATATCC 
ATATTTATCC 
CGAGATGTCC 
ACATATATAC 
AGATATCACC 
AGAGAAGTCC 
AAATATATCC 
ATGAATATCC 
GCATATATCC 
AGATATATCC 
AAATATATCC 
ATTTATATCC 
ATTTATATCC 
GAATATATCC 
ATATATATCC 
TTATATATCC 
TTAGAAGTCC 
TAAGAAGTCC 
GGACAAATCC 
G-ATATATCC 
AAATATATCC 
TGATATATCC 
AAATATATCC 



TGTCA 
TGGCC 
TGGCC 
TGGCC 
TGTCA 
TGCAT 
TGCCT 
TGCGG 
TGCGG 
TGTGA 
TGCCC 
TGCCG 
TGCCC 
TGGCT 
TGCAT 
TGTGT 
TGTGG 
TGTCC 
TGCCG 
TGCCC 
TGTAC 
TGTAA 
TGTAA 
TGCCA 
TGCTT 
TGCCA 
TGACA 
TGACA 
TGCCA 
TGGCA 
TGCAA 
TATGT 
TGTCA 
TGGCT 
TGTCA 
TGCAT 
TGTTG 
TGCCA 
TGCCA 
TGCCA 
TGACA 
TG ATT 
TGTTG 
TGGCG 
TGCCA 
TGTCG 
CAGCC 
TGCCA 
TGCGT 
TGCCA 



IAY56655S) 
IAW3 98191) 



(SGN-U2049491 
[AL1386 521 

[SGN-U1790663 
[SGN_U1583071 

ISGN-U162952) 
[SGN-U153074] 

[TC137112] 
IAL772425) 
CAP004161) 
[AP0054 96) 
(CF052264) 
[CA500064) 
ICA919279) 
[SGN-U146678] 
[AW093946] 
[SGN-U156238J 
(SGK-U1608003 
(SGN-U1497601 
(SGN-U150174] 
[BE602623] 
[AW618843] 
[AW156235] 
(AC1308O1) 
[AL713937] 

{BI421743) 
(SGN U155057] 
[SGN~U15263 5] 
(SGlfui6033 3] 

IBG131232) 
[AY0132551 
(SGN-U149960] 
(SGN-U1499791 
[AW031162) 
(BI20S800] 
{SGN-U111362] 
(AC144619) 
[AY107199] 
[X75777) 

[AY013255) 
[SGN-U1557451 




"aaagacnnnn nnataaatcc TGCAN 

CCCT G CG GC 

' GTT ■ T TG 
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associated pS!M vector numbers are indicated between paremneseb. ~, & 
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unknown function. Similarly, OsOl defines the tran si- 
tion from exon (5 ' -AGCC AGG ATATATCCTTG A) to 
intron (5 ' -GTAAGT, with donor site underlined) of the 
rice cleavage stimulation factor 77 gene. Another EST- 
identified right border alternative that is associated 
with an intron is LeOl. This element is located imme- 
diately downstream from the intron acceptor site of the 
tomato phosphatase 2C gene. Identical elements from 
potato (St02) and pepper (CaOl) were determined to 
represent similar junctions (data not shown). Interest- 
ingly, all these six right border alternatives were posi- 
tioned in the same inverse complementary position 
relative to the orientation of the gene. The generally 
GT-rich alternative strands contain, apart from AG and 
GT splice sites, binding sites for Ser/Arg-rich splic- 
ing Chancers 5'-[C/G]-[A/GHC/G]-A-[C/G]-G-[A/T] 
(Liu et al, 1998; Reddy, 2004). 

In contrast to the above- described first group ot 
right border alternatives, the position and /or structure 
of a number of other elements did not suggest an im- 
mediate link with transcript processing. As depicted 
in Figure 2B, three sequences were derived from 
untranslated trailers (3'UTRs). The barley HvOl ele- 
ment is located within the 3'UTR of the LepA gene, and 
the Medicago truncatula MtOl and maize (Zea mays) 
ZmOl are positioned in untranslated sequences of the 
AIP gene and a putative 8-amino-7-oxononanoate syn- 
thase gene, respectively. Furthermore, the Arabidopsis 
AtOl, rice Os02/ and wheat TaOl elements are all em- 
bedded within exon sequences of genes. In the case of 
rice Os03, the element is fully embedded within the 
4 3-kb 11th intron of a rice Sec8Aike gene. And nrially, 
some right border alternatives that had beer l PCR 
amplified from plant genomes represented DNAs 
lacking an apparent association with genes. Like the 
previously isolated StOl (Rommens et al., 2004), potato 
St03 appears positioned in the AT-rich intergenic UNA 
region (Fig. 2B). Collectively, our results demonstrate 
that plant-derived right border alternatives can be 
derived from a variety of genetic elements. One group 
of elements is predominantly associated, in the inverse 
complementary (antisense) orientation, with the tran- 
scribed strands of genes. In agreement with the 
transcription direction rule (Szybalski et al., 196b), 
these anti-sense borders are poor in guanine content. 

The Influence of Flanking DNA on the Efficacy of 
Right Border Alternatives 

Test plasmid pSIM551 contained St02 linked to T- 
DNA-flanking sequences of Agrobacterium pTi!5955. 



This arrangement placed the potato right border 
alternative at a distance of 12 bp from the overdrive, 
an element that was reported to promote DNA trans- 
fer (van Haaren et al., 1987). Although the overdrive 
element is believed to function in a position-independent 
manner with respect to the right border (Shurvinton 
and Ream, 1991), we found that a single basepair inser- 
tion between St02 and upstream DNA reduced trans- 
formation frequencies of pSIM579 about 2-fold (Fig. 3A). 
Furthermore, the 5'-CAA trinucleotide insertion of 
pSIM578 had an even greater negative effect on the 
efficacy of transformation, lowering it to 35%. 

To study the molecular basis of the apparent over- 
drive-St02 spacing requirement, we compared the 
pSIM551 sequence with corresponding T-DNA flank- 
ing regions of Agrobacterium plasmids. The aligned 
47-nucleotide sequences generally contained cytosine 
or thymine residues at conserved four-micleotide 
intervals, separated by adenine-rich (46%) trinucleo- 
tide segments (Fig. 3A). This arrangement resulted in 
a high occurrence of AC dinucleotide repeats (27%), 
which approached that of the overdrive element itself 
(42%). Whereas the upstream AC-rich (ACR) domains 
from both Agrobacterium plasmids and pSIM551 
comprised at least six pyrimidine residues at con- 
served positions, the impaired activity of pSIM578 and 
579 was correlated with sequences that contained four 
and five such residues, respectively (Fig. 3A). Addi- 
tional evidence for the importance of correctly spaced 
pyrimidines was obtained by analyzing pSIM580, 
which contained the pentanucleotide 5'-ACCAA in- 
sertion between St02 and upstream DNA. Mainte- 
nance of six pyrimidines at conserved positions in this 
plasmid was associated with the same DNA transfer 
activity as that of the original vector pSIM551 (Fig. 3A). 

To further test the functional significance of cor- 
rectly spaced pyrimidines, the pTil5955-derived se- 
quence of pSIM551 was replaced by the corresponding 
ACR region of Agrobacterium pRi2659 (Hansen et al., 
1992). Immediate linkage with St02 in pSIM844 re- 
sulted in high transformation frequencies (126%; Fig. 
3A). However, disruption of the pyrimidine spacing 
by a single basepair insertion (in pSIM827) almost 
fully eliminated the activity of the right border alterna- 
tive, reducing it down to 7%. 

Having correlated the original spacing of pyrimi- 
dines with efficient DNA transfer, we now also tested 
the functional relevance of adenine-rich trinucleotides. 
For this purpose, the ACR domain of pSIM551 was 
replaced with a tomato DNA fragment carrying nine 
pyrimidines at conserved positions but lacking a high 



Figure 1. {Continued.) Agrobacterium vitis (Rb04). B, Synthetic 

and Rbo2) / Agrobacterium rhtzogenes (Rb03, Rbo4, Rb05, RDOb, J ina kdl *, * Horizontal bars show 

elements. Ct, Synthetic control border; Sy *««»^ alternatives. Sequences that 

transformation frequencies compared to those supported by RbOl \. C ™* "* l Q 0vera „ rignt border/right border 

were isolated by employing PCR/inverse PCR approaches are novated ^^^^JL RbOl control vector and 
alternative consensus (CON2). Transformation frequences are shown as percentages or tnai ot 
represent the mean ± se of at least three experiments. 
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A Le02 in tomato gene (SGN-UI58307) 

S~~ FYRRIYS E intron — V E E I G W E 

r** T>r-rTTr"r & fPrtf &rsrt & TAT ATTCAGAG??^^ ^g-IMsf J| Wi\ 5 ^SSEGTAGAGGAGATAGGATGGGAGCA 

3' 



jCATCTCCTCTATCCTACCCTCGT 



Le04 in (BI929668) 



intron 



GGTtTGT^TGCTTGGGAGAAGCTAGATTCACAG. 
prJU^AO^AAACGAACCCTCTTCGATCT AAGTGTC 



DILDSFRST EFW 
^TATATTAGACAGTTTCCGAAGCACAGAATTCTGGT 
ZTATATAATCTGTCAAAGGCTTCGTGTCTTAAGACCA 



OsOl in rice cleavage stimulation factor 77 gene (AL772425) 
~"r f" M H E P G Y I L E intron 
GCGATTTATGCATGAGCCAGGATATATCCTTGA 
CGCTAAATACGTACTCGGTCCTATATAGGAACT 



GTAAGTTC [134] TGATACAG] 
CATTCAAG [134] ACTATGTC 



Y A D F L C 
^TATGCAGACTTCCTGTGT 
rATACGTCTGAAGGACACA 



LeOl in tomato phosphatase ZC gene (AW398191) 

intro n GQD I FVGKVGD S R AVLAT RD 

3^355^^GCCAGGATATATTTGTTGGTAATGTTGGTGACTCGAGAGCAGTTTTGGC^CAAGAGATA 



B 5 Jefeji ^IjFjTfTSHT^ GCCAGGATATATTTGTTGGTAA 



R H vOl in barley LepA gene (AV93942S) 

— L — E K E V L - trailer 
ACTGGAAAAGGAGGTGCTATGA [278] TGTAACTTGGGCAGGATGTATCATTTGGTATATCTTTTTT [-123] 
TGACCTTTTCCTCCACGATACT [278] A CA TTGA ACC CGTCCTA CA TAG TAAACCA TA TA GAAAAAA f -123] 

MtOl in Medicago Aip gene (new) 

y S P R D A - s* 3* trailer 

TACTCTCCGAGGGATGCTTGA [231] TTTGGAA GGTA TACCTCTGTA TACCTCCTGCCG ACGA TGTC [>174] 
ATGAGAGGCTCCCTACGAACT [231] AAACCTTCCATATGGAGACATATGGAGGACGGCTGCTACAG [>174] 

ZlTlOl in maize oxononanoate synthase gene (CF052264) 

"a — t*" a s K L - s» 3' trailer 

GCTACAGCTTCGAAACTCTAA [52] TGG TTGA CGCGTA CGCA TTTA TA TA TCCTGTGG CGTGACGCCT [>92] 
CGATGTCGAAGCTTTGAGATT [52] A CCAA CTGCG CA TGCG TAAA TA TA TAGGACACCGCACTGCGGA [>92] 

AtO 1 in a putative Arabidopsis gene (AL138652) 

"D H DYGIDDRIYRDVNSSDIIVDHG 

GATCATGACTATGG CATTGATGACAGGATATATCGTGATGT CAACTCGT CAGATATTATAGTAGACCATGGAG 
PTAGTACTGATACCGTAACT ACTGTCCTATATAGCACTACAGTTGA GCAGTCTATAATATCATCTGGTACCTC 

3' s# 

Qs02 in hypothetical rice gene (AP008208) # 

SSSSSPVAAPLQKNISCIQPSVRP 
Tr*fZAGOTCCTClATCGCCTGTAGCGGCAC CACTACAAAAAAATATATCCTGCATT CAACCCTCAGTX3CGCCCCG 
AGCTCGAGGAGTAGCGGACATCGCCGTGGTGATGlVrTTTTATATAGGACGTAAGTTGGGAGTCACGCGGGGC 



TaOl in putative wheat MAP3K alpha 1 protein kinase gene (CK161660) 

DIH IRS N I SCPVS PCGS PL LR. 
GATATCCATATACG GAGCAATATATCCTGTCCA GTTTCTCCATGTGGAAGTCCTCTGCTAAGG 
3CTATAG GTATATGCCTCGTTATA TAGGACAGGTCAAAGAGGTACACCTTCAGGAGACGATTCC 




Qs03 in rice Sec*-like gene (AP008214) 
A I S 
GCTATATCTA 
CGATATAGAT 



GTAAGTT [2782] ATGTACGTATATATATATCCTGTGT [1454 J TTTGGCAG 
CA'TTCAA 1 276 2 \ TACATGCATATATATATAGGACACA [ 1454] AAACCGTC 



SPA 
3TCCAGCCGCA 
2AGGTCGGCGT 



St03 intergenic potato DNA 

itcTiitacttttaatatactgagcatgcaggatatatgctaagctaaacaaataacaactaaacatggaaata 
^Lt>t a t ga aaattatataactca tacqtcctat a tacaattcqatttgtttattgttgatttgtacctttat 

3' 5 ' 

Figure 2 Genomic context of plant-derived right border alternatives. A, Elements (underlined) at intron-exon junctions. Arrows 
indicate the most frequent intron-flanking dinudeotides. B, Right border alternatives positioned within untranslated leaders 
(depicted in gray italics), exons, introns, and intergenic DNA (lowercase). Exon sequences are shown in black with ammo acids 
that they encode in red. introns are boxed. The number of basepairs omitted from the text is shown between brackets. Accession 
numbers of genomic clones, if available, are shown between parentheses. 
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5' 



+1 



Right Border/RBA 



P SIM55I CTTAGAGATC T£ ^QAAAW*»^^^ (100 ± 5) 

pSIM579 TTAGAGATCTCAAACAAACAJACA^^ < 48 *• X) 

pSIM578 AGAGATCTCAAACAAAOVT^^^ (3S * 4) 

P Til5955 AGAAACAATCA^CAAA<^JACA^^^ < ND > 

pTiCSS GCCX^ITTTAAATATCCGATTATTCTAATAAACGCTCTTTTCTCTTAGGTTTACCCGCCAATATATCCTGTCA (ND) 

P Ri2659 TGACGA A CTOACGAACTX^CGAACTGAOjAACTCACGAA C^GACGAACTAGACAA (ND) 

pRiA4 TAA CAMTgAACgttTG^^ (ND) 

P Ri8l96 TAG^ATTGCA^TCCAAAGGCAGGCACGTACAAAC^AATTTATTTAGCCGACAACGGAATATATCCTGTCA (ND) 

pRil724 GAAGGC^CGAAGGCACGAAGGCACGAAGGCACGAAGGCA CGAAGGCACATTACTTTAGAATATATCCTGTCA (ND) 

pRiDBIR TCATCAC^GCCGTCCMAACAAACATACCTCCACACAAATTTATCTACCTGACCACAAGATATATCCTSTCTi (NO) 

P S1M580 AGATCTCAAACAAACATACACAGCGACTTATTCACAACTAGTACCAACATTACCAACTiAATATATCCTGGCC (120 _. 

P SIM844 TGACGAACTGACGAACIX3ACGAACTGACGAACTGACGAACTACCJVAACATTACCAACAAATATATCCTGGCC (126 ± 5 

P S1M827 CTGACGAACTGACGAACTGACGAACTGACGAACTGACGAACTACCAACATTACCAAC^ (7 ± 2) 

ACR domain 



5) 





240 «r 




220 - 




200 - 


J 


180 - 








160 - 




140 ■ 




120 - 




-1000 



1000 



c 

OD 

pTiC58 

pTi 15955 

pRi2659 

P Ri4 

pRi8196 

pRi!724 

pSIMl(>8 
pSlM551 
pSlM920 
pSIM78l 
pS!M582 
pS!M793 
pSIM843 



CA AACAAACATA CACAGC(^CTTA 

+ G TCAG CATC ATCACACC AAAAGTT AG G^CCGAATA^TTTGAAATTAGAAAGCTCGCAATTGAG 
AAC ACTGATAGTO TAAACCGAAGGCGGGAAACGACAATCTGATCATGAGCGGAGAATTAAG^ 
AATAACAATCT CATGTTAG<^ AATAATATCACCCAATCAAQGCGGCCACGCAATTAACCTGGT 
GTCAATCAGCAAACAGC A^GTAGTT ATTGTCTGTGAAGATATGTAGGTACCTTTCACCCAC 
G CACTAATATA AGAAATGTCCTGTCAGCACTAATATAAGAAATGTCCTGCTTTGACGCAAGTG 
AA ^TATTCGTT AATAGGGACGTCGTACCTACTTCCCTTCCAGCGCAGCAAACAGTAGGTCTO 

a a t^T ATAGAGGCATGACTGG CATG AT ^CT AAATTG ATG CC CACAG AGGAGACTTATAAC CT 
rcftflfC CGGT ACCCGGGGAT C AATT C CCG AT CT AG TAAC ATAGATG ACACCGCGCGCGATAATT 
GGGCCCGGTTCCCGGGGATCAATTGGGCCCGGTACCCGGGGATCAATTCCCGATC^AG 
GGGCCCGGTACCCGGAGGAGACTCCGATCTACGGCGCCAAATTCAAGGACGGAGAACTTCOTG 
CTGAGGACATTC A G AAGATTGGT TATATCCTCTTTC AAGACG CTAAGCAATTAGTG ATG CAAA 
G AGG TAT AG AGGCATGTCTGG^TGATCACTAAATT^ATGCCCG CAGAGGGG ACTTATAACAT 
GGGGCCCGGTA CCCGTTAG^ CTAG C^GAAAGGG CC^CGGGCAGCCCGTTAGCXrCX3CATAAC 



(100 ± 4) 

(ND) 

(ND) 

(ND) 

(ND) 

(ND) 

(112 ± 5) 
(100 ± 5) 
(60 ± 4) 
(89 + 4) 
(59 ± 4) 
(105 ± 4) 
(168 ± 12) 



Figure 3. Sequences flanking right border alternatives. A, Upstream sequences display a conserve* I organization ^^ine/ 
thymine residues separated by adenine-rich trinucleotide spacers. The overdrive sequence of pTll 5955 .s underl.ned (dotted). 
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percentage of adenine residues in the intervals. The 
resulting vector pSIM581 displayed only 22% of the 
transformation efficacy of pSIM551, indicating that 
adenine-rich intervals and/or AC repeats play a role 
in the functional activity of the ACR domain (Fig. 3 A). 

Because adenine-rich DNA is often associated with 
low helical stability regions, we determined the helical 
Sim profile of pSIM551 using WEB-THERMODYN 
(Huang and Kowalski, 2003). This analysis identified 
a 120-bp sequence immediately upstream from the 
St02 cleavage site, and including the ACR domain, 
to represent the lowest helical stability region of 
the P SIM551 backbone (Fig. 3B; data not shown). The 
association of an easily unwound DNA region imme- 
diately upstream from the right border alternative 
may be functionally relevant because Agrobacterium 
Ti and Ri plasrnids contain similar low helical stability 
regions at their right borders. For instance, pTiC58 
contains a 120-bp region preceding the border with 
a stability of 116 kcal/mol. Analogous to the associa- 
tion of low helical stability regions with the initiation 
of plasmid replication (Natale et al., 1993), these 
upstream DNAs may be involved in the initiation of 
DNA transfer. 

Given that upstream DNA sequences adjacent to 
St02 influenced transformation efficacy, we sought to 
test the effect of downstream modifications. As shown 
in Fieure 3C, sequence analyses identified decamers 
that Shared the consensus 5'-[A/C/T]-[A/C]-[A/C/ 
T]-[A/G/T]-[A/T]-T-[A/C]-G-[G/T]-[G/T] with the 
5' part of the overdrive, and were positioned at 
a distance of one to 27 nucleotides from the right 
border This "downstream from right border" (DR) 
domain was also identified in both the potato-derived 
T-DNA (Rommens et al., 2004) of P SIM108 and test 
vectors such as P SIM551 (Fig. 3C). An increase m the 
spacing between LeOl and DR domain from 24 nu- 
cleotides in pSIM551 to 48 nucleotides in pSIM920 
lowered transformation frequencies by 40 /o (i*ig. 
indicating that the supporting function of the DR 
domain on border activity is spacing dependent. 

Because downstream DNA sequences represent the 
actual T-DNA that is intended for plant transforma- 
tion we replaced the original bacterial sequences of 
PSIM551 with two unique potato DNA fragments. The 
pSIM551-derivative pSIM793, which contained a DR 
domain at 27 nucleotides from LeOl, yielded about the 
same transformation frequency as pSIM551. In con- 
trast the potato DNA fragment of pSIM582, which 
contained a DR domain with several mismatches to 



the consensus, displayed only 59% activity. Interest- 
ingly, replacement of LeOl-flanking DNA sequences 
by an alfalfa DNA fragment that contained two 
different DR domains triggered unusually high trans- 
formation frequencies for the resulting vector pSIM843 
(168%; Fig. 3C). We conclude that sequences flanking 
right border alternatives play an important role 
in supporting plant DNA transfer. These sequences 
comprise upstream ACR and downstream DR do- 
mains. 

Substitution of Left Borders by Right Border Alternatives 

The above-described studies had shown that CON2- 
matching 25-bp elements function as effective right 
border alternatives if flanked by sequences tnat sup- 
port their activity Would it also be possible to /use such 
elements as left border alternatives? The answer to this 
question was not immediately obvious because the 
intended function of left borders, namely secondary 
cleavage of the T-DNA, is different from that of right 
borders, which mediate primary cleavage reactions. 
As shown in Figure 4A, this functional difference is 
associated with a divergent sequence organization at 
and around the border sites. In contrast to right 
borders, left borders (1) are preceded by AT-rich 
DNAs comprising an "upstream from left border" 
(UL) domain on either DNA strand with the consensus 
sequence A[C/T]T[C/G]A[A/T]T[G/T][C/T][G/T][C/ 
G1A[C/T][C/T][A/T], (2) share a more conserved con- 
sensus sequence, 5'-[A/G]TTTACA[A/C/T][A/C/T][A/ 
C/T] [C/G]AATATATCCTGCC[A/ G], and (3) are linked 
to downstream plasmid backbone DNA by C-clusters that 
conform to the consensus CCN 1 . U CCN 1 . 11 CCN 1 . 11 CC. 

Direct evidence for the role of the C-cluster organi- 
zation in supporting left border activity was obtained 
by comparing the fidelity of DNA transfer for pSIM831 
and 829. Both vectors contained an expression cassette 
for the nptll gene preceded by DNA regions compris- 
ing St02 as right border alternative, and were con- 
firmed to support the same high transformation 
frequencies as pSIM551 (data not shown). The vectors 
also contained almost identical DNA regions for 
secondary cleavage, which differed only in that 
pSIM829 contained a 10-bp insertion in the fourth 
left border-associated C-cluster (Fig. 4B). The effect 
of this small change was assessed by classifying 
regenerated shoots in three groups based on PCR 
analyses. The first "T" group only contained the 
intended T-DNA, and would therefore be predicted 



K ^^Kitated with gray arrows. Transformation efficacies are shown between parentheses as percentages of 
Direct 'epeats a e ' n °™ + * Q y f lhree experiments. *'+ 1 " indicates the position of the first base of the right border or nght 
bo^ shinty profile (kcal/mol) across the extended 2-kb St02 region o pSIM551 

Tin !hn Tf!n ci« and 1 20-bo window size. C Downstream sequences comprise a DR domain (red wfth mismatches in black). 
™ Z ? A 9 ^^J^^TbT^3 ^B43 contain DNA fragments from a potato homolog of AY566555, a potato 
J2£J ^ truncatula AC13 10 26, respectively. Plasmid P 5IM582 contains LeOl 

S hlKjA sequence tha'flanks the element in its original genomic context. Transformation frequences 
«tMlS« parentheses as percentages of controls and represent the mean ± se of three experiments. 
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Transformation-% 

T TB B 



pTil5955 2^ 

tctccatattgaccatcatactcartgctgarccatgtagatttcccggacatgaagcc 
atttacaattgaatatai^tg^|g|^ct(^^ctttgca^c§ 

pTiC58 



Tr:&aTTCAGTACATTAAAAACGTCCGC^rCTgTTATTAAGTT GTCT^ GCGTCAATTT 
nTTTAC^jc^CAATATAlj^ 



pRi2659 ^ 

ATCTGGTAATATAGCAAAAACGrgC TCAAA AATCGCrrCAAAGCTCTTGTACTTAGCTC 
GTTTACA^gACAATATATjc^r G|cg A |CCCC| 

pRiA4 



TACATTTTATATT CGATAAA GCA TG CG JT AAAACGAC TTCGCATGT CCATATCTAATCT 
GTTTACATC^CAATATATjccfr^ 

B 

pSIM831 ^ 

aaatctgattgataaaggatcgarcctctagagtcgacctgcagtacttacgtacaatt 
gttta c^53acaatatat|^tgcca[c^ggatatattc^^taggag^^ac^^aacagct 

|cccc| GA|cq 

pSIM829 ^ 

AAATCTGATrGATAAAGGATCGATCCTC^GAGTC^^ 
GTTTACA|j^C^TATA^T^^ 

c 

pSIMlOS 



TCCTTCATAGCTACACTTTCTAAAGGTACG A TA GA TTTTGGATCAAC CACACACACTT 
CTTTAC^^TATAT^^ 



pSIM843B __ 

gtaaaaaataaaagtgaaaa ttcaa 3t3aattaacacaaatataaatgt aatata^aatt 
gtataIc^ctgtatacm|^ 



pSIM84 9 

aatggaggtaagtgtttctgctcagtgctgatag atgt aaatat^ctgttatgaagc^ 

GTAl^j^CTGTATAC^T^^ 
PSIM781 

TGTTGAAGGCTTGGATGTGATTA AGAAGGC CGAGG CTGT TGGATCTAGTTCT^ 

catta[c^caaatatat|^ 



26 + 8 


55 ± 9 


19 ± 6 


ND 






ND 






ND 






41 ± 3 


52 ± 5 


7 ± 2 


17 ± 2 


68 ± 5 


15 ± 4 


41 ± 5 


48 ± 6 


11 ± 3 


34 ± 4 


53' ± 6 


13 ± 2 


10 ± 2 


83 ± 7 


7 ± 2 


11 ± 1 


58 ± 3 


31 ± 2 



Figure 4. DNA sequences flanking left borders and left border alternatives. A, Left border region ofji and Ri plasmids. B, 
Modified left border regions. C, Regions comprising StOl, MsOl, or St02. Upstream DNA is shown in blue with UL domains 
indicated in bold. Left borders and left border alternatives are highlighted in gray. C-clusters are boxed. Frequencies of transgenic 
plants containing the designated T-DNA delineated by borders or border alternatives ("T"), the T-DNA still attached to backbone 
sequences ("TB"), and backbone only ("B") are shown on the right and represent the mean ± se of three experiments. ND, Not 
determined. 



to have arisen from primary cleavage events at the 
right border followed by secondary cleavage at the 
left border. Plants containing both the T-DNA and 
additional backbone DNA sequences were classified 
in a second "TB" group, and most likely repre- 
sented events where the second copy of the border 
alternative failed to function in terminating DNA 
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transfer. The third "B" group of events only con- 
tained backbone DNA and probably arose from 
initial cleavage reactions at the second St02 copy. 
This genotype classification demonstrated that 
pSIM831 was more than twice as effective as 
pSIM829 (41% versus 17%) in producing "T" events 
(Fig.4B). 
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Efficacy of right border alternatives as sites for 
secondary cleavage was studied by testing pSIM108 
and 843B. The vectors contained StOl and MsOl, re- 
spectively, as right border alternative. The down- 
stream region of pSIM108, shown in Figure 4C, 
contained a second copy of StOl inserted between (1) 
AT-rich (62%) DNA derived from the terminator of the 
potato ubiquitin-3 gene (Garbarino and Belknap, 1994) 
containing a UL domain, and (2) plasmid backbone 
DNA comprising five C-clusters. Similarly, the DNA 
region intended for secondary cleavage in pSIM843B 
contained a second copy of MsOl preceded by an 
AT-rich (87%) alfalfa DNA fragment and followed by 
downstream C-clusters (Fig. 4C). The binary vector 
pSIM401, which contained the extended left border 
region of pTiC58, was used as control. PGR genotyping^ 
demonstrated that both pSIM108 and 843B yielded 
even higher frequency of backbone-free transforma- 
tion events (41.1% and 33.9%) than obtained with the 
control (26.0%), thus indicating that right border 
alternatives can be used to replace left borders. How- 
ever, subsequent experiments also showed that such 
replacements need to be carried out cautiously. A 
modification of pSIM843B that both eliminated the UL 
domain and altered the spacing of C-clusters dramat- 
ically lowered the frequency of desired "T" trans- 
formation events for the resulting vector pSIM849 to 
10.2% (Fig. 4C). This reduced frequency was associ- 
ated with an about 2-fold increased transfer of DNAs 
that are still attached to their vector backbones, in- 
dicating that the modifications of flanking DNA in- 
terfered with effective secondary cleavage at the 
second MsOl copy. Similar alterations of DNA sur- 
rounding the second StOl copy of pSIM108 resulted in 
an almost 4-fold reduced transformation efficacy of the 
resulting vector pSIM781 (Fig. 4C). 

Collectively, our data demonstrate that right border 
alternatives can be used to replace left borders if 
associated with the upstream UL domain and down- 
stream C-clusters. Even small changes in this organi- 
zation were found to have a profound effect on the 
frequency of backbone-free plant transformation. Re- 
placement of the internal nptll gene expression cas- 
sette of pSIM843B by alfalfa DNA would make it 
possible to produce intragenic alfalfa plants. 



DISCUSSION 

This article describes 14 new and functionally active 
right border alternatives from dicotyledonous (potato, 
tomato, pepper, alfalfa, and Arabidopsis) and mono-' 
cotyledonous (rice, barley, and wheat) plant species. 
Because analyses of the 125-Mb haploid Arabidopsis 
genome and 3-fold larger rice genome uncovered 
a single Arabidopsis element and three elements in 
rice, the occurrence of right border alternatives was 
estimated at a frequency of at least 0.8 X 10~ 8 . Thus, 
the 8-fold larger tomato and potato genomes are 
expected to each contain more than eight such se- 
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quences, several of which are described in this article 
Continued plant genome sequencing and data mining 
efforts are anticipated to result in the identification of 
many additional right border alternatives from these 
and other plants. Alternatively, it is possible to link the 
sequences of at least two different plant ESTs together 
and create new synthetic right border-like elements 
We have tested two such elements, S'-GGACAGC A 
TATATAA AGTGTAAAAC and 5'-GGACAGGATA 
TATAAAGTGTAAAAC, the sequences of which were 
recently disclosed (www.crop.cri.nz). In our tobacco 
transformation system, these elements supported 86% 
and 7%, respectively, of the transformation efficacv of 
conventional right borders (CM. Rommens, O Boueri 
and J.M. Humara, unpublished data). ' 

In addition to plant-derived right border alterna- 
tives, such elements were also identified in fungi and 

T£3fif F ° r instance - m ouse accession number 
AC110541 contams the sequence 5'-AGGCAGGACT 
TAATGTGGTGTAAAC. With the discovery that Aero- 
bacterium can be used for transformation of a wide 

XfSH^^ Sp6CieS (Kunik et aL ' 200 1; Casas-Flores et al 
2004), the new right border alternatives may be used to 
extend the concept of all-native DNA transformation 
(Rommens, 2004) to eukaryotes other than plants 

Many of the plant-derived right border alternatives 
described in this article are associated with transcribed 
DNA. These sequences represent untranslated trailers 
introns, exons, as well as intron-exon junctions. The 
latter group of elements was identified most frequently 
and suggests a function of the GT-rich antisense border 
sequences in intron splicing. Because guanines and 
thymines each have a choice of two bases to pair with 
guanines with cytosines and thymines, and thymines 7 
with adenines and guanines, the antisense borders are 
ideal for secondary structure formation, which is gen- 
erally believed to play an important role in pre-mRNA 
splicing (Patterson et al., 2002). Interestingly, we found 
the inner part of the LeOl intron of the phosphatase 2C 
gene to share homology with introns of many other 
genes, such as a tomato permease gene, a Sue synthase 
gene, a TPR domain-containing protein gene, and a 
receptor kinase-like protein gene from potato (CM. 
Rommens, unpublished data). Although it is conceiv- 
able that such introns evolved from ancient insertions 
of the same mobile intron (Long and Rosenberg, 2000), 
alignment of corresponding intron-exon junctions did 
not reveal any conserved sequences. Thus, LeOl does 
not appear to represent the remnants of an ancient 
proto-splice site. Sequences that flank the right border 
alternatives in their original plant genomic context 
Jacked homology with known bacterial DNA (CM. 
Rommens, unpublished data), indicating that these 
regions also did not originate from early bacterium- 
mediated transformation events. 

Efficacy tests of border-like sequences made it 
possible to define a consensus sequence for right 
border alternatives. This consensus allows consider- 
able degeneracy, especially in the 5' terminus, and 
confirms the previous notion that mismatches with 
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Agrobacterium right border sequences are often toler- 
ated (Rommens et ah, 2004). It also suggests that both 
right borders and right border alternatives contain two 
separate domains, a possible downstream binding site 
for virD2 preceded by a domain that may function 
as site for an accessory protein. Because the latter do- 
main resembles the relaxed consensus binding site of 
the eubacterial initiator protein of DNA replica- 
tion (DnaA), 5'-[C/T]-[C/T]-[A/C/T]-T-[A/C]-[C/ 
T]-A-[A/C]-A (Roth and Messer, 1998; Speck and 
Messer, 2001), this or a related protein may be involved 
in supporting local DNA unwinding. 

The sequence immediately upstream from right 
borders and right border alternatives represents a low 
helical stability region that may be involved in DNA 
unwinding as well. In Agrobacterium plasmid 
pTil5955, this region contains a 25-bp sequence (over- 
drive) that has been reported to promote right border 
activity (van Haaren et al., 1987). Our work demon- 
strates that the overdrive is part of a larger ACR 
domain that is conserved among Agrobacterium Ti 
and Ri plasmids. We found the enhancing activity of 
this region to require at least six pyrimidine residues 
at conserved positions. The sequence and spacing 
requirements of the ACR domain may explain why 
certain previously characterized Ti plasmid mutants 
displayed reduced virulence (Shurvinton and Ream, 
1991). For instance, the impaired activity of mutant 
WR1803 is associated with an insufficient number of 
pyrimidine residues at conserved positions, and the 
low efficiency of WR1804 is linked to a lack of adenine- 
rich spacers. This weakly conserved repetitive and 
spacing-sensitive character of the ACR domain shares 
a similar architecture with the repeat region that is 
linked to the replication origin of, for instance, 
repABC-type plasmids. Like these origin-associated 
repeats, the ACR domain could play a similar role in 
DNA unwinding (Jakimowicz et al., 2002; Betteridge 
et al., 2004; this work). A candidate protein facilitating 
strand separation in Agrobacterium plasmids is virCl. 
Because the overdrive sequence has been shown to 
function as binding site for virCl (Toro et al., 1989), it is 



possible that virCl also binds to the broader sequence 
context of the various AC-rich regions. The function of 
virCl has not been elucidated yet, but homology with 
bacterial parA partitioning proteins suggests that 
virCl may bind single-stranded DNA to extend un- 
wound DNA regions (Easter and Gober, 2002). 

Apart from the presence of potentially DNA un- 
winding sequences within extended right border re- 
gions, the AT-rich sequences upstream from left 
borders and left border alternatives represented low 
helical stability regions (CM. Rommens, unpublished 
data). Thus, the facilitation of local DNA unwinding 
may not just be important for initiation but also for 
termination of DNA transfer in a similar way as 
described for conjugati ve DNA transfer.. YRran et al 
2004). In the case of pTiC58, this AT-rich DNA contains 
the sequence 5 ' -TTC AGTACATTA, which resembles 
the putative binding site for integration host factor 
(JHF) binding, S'-fA/TJ-TCAANNNNTTA (Friedman, 
1988; McGuire et al., 2000). Because IHF is involved in 
virtually all forms of nucleoid manipulation including 
DNA unwinding (Ryan et al., 2004), it is possible that 
this protein supports DNA cleavage at left borders in 
a similar way as reported previously for oriT (Karl 
et al., 2001). 

Agrobacterium left border regions can also be char- 
acterized by the presence of at least four C-clusters at 
the junction between the borders and upstream DNA. 
The C-clusters may form into tertiary quadruplexes 
at slightly acid or neutral pH, in a similar manner 
as described for mammalian C-clusters (Neidle and 
Parkinson, 2003; Zarudnaya et al., 2003). It is possible 
that the complex folding associated with C-clusters 
negatively affects processes that are associated with 
DNA transfer, possibly limiting the frequency of back- 
bone DNA transfer. The organization of extended 
regions for the initiation and termination of DNA 
transfer is summarized in Figure 5, accounting for 
domains of local DNA context around both the primary 
and secondary cutting sites described in this work. 
These sequence requirements should be considered in 
efforts to create novel all-native P-DNA vectors for 
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Figure 5. General organization of extended border 
regions. Putative sites for DnaA and IHF are indicated 
with open vertical arrows. Right and left borders are 
shown as yellow boxes. The positions of cleavage 
sites are depicted with triangles. The direction of 
DNA unwinding is indicated with a dashed horizon- 
tal arrow. 
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MATERIALS AND METHODS 

PCR-Based Isolation of Right Border Alternatives 

Plant DNAs (2 /xg), partially digested with SatdUA, were ligated with 
192-bp BawHI-EcoRV fragments of pBR322. The resulting DNAs were used 
as templates for amplification with the primer pair 5'-YGR CAG GAT ATA 
TNN NNN KGT AAA C-3' (degenerate border) and 5'-GAC CAC ACC CGT 
CCT GTG-3' (anchor primer), with 49°C annealing temperature and 
2 5-min extension time. Subsequent PCRs were performed with the amplified 
DNAs ligated with pGEM-T as templates using the border primer together 
with either SP6 or T7 primers at a slightly higher annealing temperature 
(52°C) The products of these reactions were inserted into pGEM-T and 
^quenced to design primers for conventional inverse PCRs to determine the 
actual sequence of the putative right border alternative. 

Database Searches and Sequence Analysis 

Publicly available databases, including those maintained by the National 
Center For Biotechnology Information, were searched for T-DNA border 
sequences using the Motif Alignment and Search Tool (BaUey and Gribskov, 
1998) and advanced BLASTN with penalty for nucleotide mismatch set at -1 
and expect value at 10 s (Altschul et al., 1997). Additional databases that were 
searched include those covering Solanaceae (http://www.sgn.corneU.edu/), 
Compositae (http://compositdb.ucdavis.edu/), and Medicago truncatula 
(http //www.genome.ou.edu/medicago.html). Folding of single-stranded 
DNA was studied using both the Mfold Web server (Zuker, 2003) and the 
Bielefeld Bioinformatics Server (Giegerich et al., 2004). Free energy profiles for 
unwinding 50-bp windows of double-stranded DNA across Ti plasmids were 
estimated using WEB-THERMODYN (Huang and Kowalski, 2003). 

Plasmid Construction 

The plasmid that was used for construction of pSIM-T vectors comprised 
unique Kvnl and Spel sites flanked by sequences upstream from the right border 
of P Til5955. Details are described elsewhere (Rommens et al., 2004). Both nght 
borders and right border alternatives were inserted into this plasmid by 
Heating linearized plasmid DNA with annealed primers comprising the 
sequences of interest flanked by the sticky ends of Kpnl and Spel, respectively. 

Isolation of Plant DNA Fragments 

The following primers were used to amplify DNA fragments from plants: 
5'-TTATGC GGG CTA ACG GGCTG (forward) and 5'-GGG CCC GGT ACC 
CGT TAG GGC TAG (reverse) for the alfalfa (Medicago sativa) DNA fragment 
downstream from MsOl of pSIM843; 5'-GGG CCC GGT ACC CGG AGG AGA 
CTC (forward) and 5'-GAA GAA GCA CAC CGG CAC TGG AATT (reverse) 
for the potato (Solanum tuberosum) DNA fragment downstream from St02 of 
PSIM781- 5'-GAG GTA TAG AGG CAT GTC TGG CGT GAT C (forward) and 
5'-GTG AAG TTr ATA ACA TGT TGA AGG AGC TCC A (reverse) i fa r the 
potato DNA fragment following St02 of P S1M793; 5*-GGT ACC CTC TGT 
TGA CCA GGATAT G (forward) and 5'-ACT ACT CAT TAC CAA CAA ATA 
TAT CCT GG (reverse) for the tomato (Lycopersicon esculentum) DNA fragment 
downstream from LeOl of pSIM582; 5<-CTT ACG TAG AAT TCT GTG CCA 
TG (forward) and 5'-GTT GGA TCT AGT TCT TGA AGT T (reverse) for the 
potato DNA fragment upstream from the second StOl copy of P SIM781B; and 
5' -AAT TTT ATA TTA CAT TTA TAT TTG TG (forward) and 5 '-CAT AAC 
AAA AAA AAA TTC TAT AAA TTA T (reverse) for the alfalfa DNA fragment 
preceding the second copy of the MsOl element of pSIM843B. 

Plant Transformation 

Binary vectors were introduced into Agrobacterium tumefaciens LBA4404 
cells as follows. Competent LB4404 cells (50 M U were incubated for 5 mm on 
ice in the presence of 1 /tg of vector DNA, frozen for about 15 s in liquid 
nitrogen, and incubated at 37*C for 5 min. After adding 1 mL of liquid broth, 
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the treated cells were grown for 3 h at 28°C and plated on liquid broth/ agar 
containing streptomycin (100 mg/L) and kanamycin (100 mg/L). The vector 
DNAs were then isolated from overnight cultures of individual LBA4404 
colonies and examined by restriction analysis to confirm the presence of intact 
plasmid DNA. 

A 10-fold dilution of an overnight-grown Agrobacterium culture was 
grown for 4 to 5 h, precipitated for 15 min at 3,800 rpm, washed with 
Murashige and Skoog liquid medium (PhytoTechnology, Shawnee Mission, 
KS) supplemented with Sue (3%, pH 5.7), and resuspended in the same 
medium to an optical density at 600 nm of 02 (for evaluation of new borders 
using pSIM-T vectors) or 0.04 (to assess the efficacy of new border-flanking 
DNA sequences). The suspension was then used to infect leaf explants of 
3-week-old in vitro grown tobacco (hiicotiana tabacum) plants. Infected tobacco 
explants were incubated for 2 d on coculture medium (one-tenth Murashige 
and Skoog salts, 3% Sue, pH 5.7) cc^taining 6 g/L agar at 25°C in a Perdval 
growth chamber (16-h-light photoperiod) and subsequently transferred to 
M40l/agar (PhytoTechnology) medium containing timentin (150 mg/L) and 
kanamycin (100 mg/L). Two weeks after inoculation, the number of calli per 
leaf explants was scored, and averages of all the explants for each of the 
treatments were calculated. The tobacco calli assay is highly quantitative 
because individual calli can be -Tinted, and the large sample size (at le?st 
50 different explants /infection) permits statistical analysis. 

Plant Genotyping 

Plant DNA isolation and subsequent PCR were carried out as described 
previously (Rommens et al., 2004). 
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Exhibit 3 

Example of BLAST results for the query sequence CGACAGGATATATTTGTTGGTAATC 
that represents the border motif YGRYAGGATATATWSNVBKGTAAWY. The border-like 
sequences are underlined. 



Database: Database of GenBank+EMBL+DDBJ sequences from EST 

>gi I 25018487 | gb I CA517930.il KS10002H08 KS10 Capsicum annuura cDNA, mRNA sequence. 
Query 4 CAGGATATATTTGTTGGTAAT 24 

I I I I I I I I I I I I I I I I I I I I I 
Sbjct 298 CAGGATATATTTGTTGGTAAT 318 

CGAGATTATCTTCCAATTTTTCTGCGATCAGAGTGGGAAACTAAATCCTATGGTGATCAAAATGCTGTGTCTGAGAACGGAAATACTAATGGAG 
GCTTGCATTTTGACGACCTTGTGGATGATGATTTGGTCGAATCAGTGGAAGCTCAGAACAATGAAAATTTCCCAGAAATCCATCTGCCTCTTAA 
GAGGTCAATATTAAGGGCTTTTAGATCAATGGATAAGGAACTGAAGTTACACCCCTCAATTGACTGTTTCTGCAGCGGGTCAACTGCTGTTACT 
TTGGTAATGCAG GGCCAGGATATATTTGTTGGTAATG TTGGCGACTCGAGAGCTGTTTTGGCAACAAGAGATAAAGACGACAACTTAATGGCCG 
TGCAGTTGACAGTAGATTTGAAGCCTAATCTTCCAAGAGAAGCTGCTAGAATCCATAAGTGTAAAGGAAGGGTTTTTGCACTACAAGACGAACC 
AGAGGTTGCACGTGTGTGGTTGCCAAAC 

>gi| 16248687 | gb[BI934215. 1 ! EST554104 tomato .flower, anthesis Lycopersicon esculentum cDNA 
clone CTOD19K21 5' end, mRNA sequence. 
Query 4 CAGGATATATTTGTTGGTAAT 24 

I I I I I ! I I I I I I I I I I I I I I I 
Sbjct 413 CAGGATATATTTGTTGGTAAT 4 33 

AACCCTATGTGGAGTATTTGATGGGCATGGTCCTCATGGTCATATGGTTGCCAGTAAAGTTCGAGATTATCTTCCAATCTTGCTGCGATCAGAA 
TGGGAAACTAAATCCTGTAGTGATCTAAATGATGTATCCAAGAATGAAAATACTAATGGAGGCTTACATTTTGACGACATTGTGGATGATGATT 
TGGTTGAATCAGTGGAAGATGAGAACAATGTACAATTCCCAGAAATCCATCTACATATTAAGAGGTCAATATTAAGGGCTTTTAGATCAATGGA 
TAAGGAACTGAAGTTACACCCCTCAATTGACTGTTTCTGCAGCGGATCAACTGCTGTTACTTTGGTAAAGCAGGATATGATGTTG7UUVCATTAT 
ACTTTCAGAAAATAATATCAATTGAGTTGGAAG GGCCAGGATATATTTGTTGGTAATG TTGGTGACTCGAGAGCAGTTTTGGCAACAAGAGATA 
AAGACAACTCCTTAATGGCTGTACAGTTGACAGTAGATTTGAAGCCTAATCTTCCAAGTCTGTTTGTCTCTTTCTCTTACCCGCCCCACCCAAC 
ACAGTGCACATTAGCTGAAGCTGCACTTATATTTATTTCAATTGTTTGAACTCAGTAGTTTCTGTCTTCTGCTTGTAGGAGAAGCTGCTAGAAT 
CCATAAATGTAAAGGAAGGGGGTTTTGCACTGCAAGATGA 

>gi I 78085038 | gb | DV513431.il ZM_BFb0194 J20 . f ZM_BFb Zea mays cDNA 3 ? , mRNA sequence. 
Query 4 CAGGATATATTTGTTGGTA 22 

I I I I ! I I I I I I I I I I Ml 
Sbjct 178 CAGGATATATTTGTTAGTA 196 

AAACGGAACACGACGTGTGTACAATTTTTGCAATCCTTCATACACCAACCACACCTGGCAAAACCTCATGGAAGGAAGGAAATCATCAACCCAA 
ATGACTGAAAAATTGGGAGGCACTGGTCAAGCCTTCTACTCTGCTCTACTAGTCTAAGCTGCTGCT 

TGTTAGTACTT AGTACGGTACGGTGGTGCTCCTTCTAGTCACTTGTTCCTCTGGTGGTGCTGGGCTGTGGCGGTGGCGGCGGCTGCTCTGCTCG 
TTTGGTCTCTGCTCTTGAGGGCAAAGCAGCATGCTGCGGAGGTGGTTATTGGTCATCATGCCAGGCGCAATTGACACTGACAAGGTTAGTACGA 
GCTCCGTCATCTCCTCACACATCCCTACTGTCGCGATCGGACGGCTCAGACGGTCGCGGCCGGAAGACCACGGGCGCCCCGTACTGCGGGTGGA 
GGAGCATCACCACCGTCGGCGCGTGGAGGTAGCTGGGCCACACCCGGAAGTCCAACCGCTGGAGGACGATGGCCAGCGTGAGCTTACACTCCAG 
GAGCGCCAGGTTCTGGCCGATGCACATCCCCCCGCCCAGCC 

>gi I 73933229 | gb | DT516539.il WS02431 . B21_L08 PTxD-ICC-N-A-14 Populus trichocarpa x Populus 
deltoides cDNA clone WS02431_L08 3', mRNA sequence. 
Query 4 CAGGATATATTTGTTGGTAAT 24 

I I I I I I I I I I III I I I I I I 
Sbjct 404 CAGGATATATGTGTCGGTAAT 384 

TTTTTTTTTTTTTTTTTTATCGCAATTGTGATTAAATTAATAATCATTGTGATAGAATCAAAATCCTTCCATGGATGCATATTAATCAAATGAA 
TTAATAAAGACTTGGAGCAAGCATGCAAGATACACAATGCATGAATTAAGCGAGATCCCCTCCACATATTCCGTTGGAGGCAGGTGTCTGGCCC 
CACCGGAAGCGACAGCGGCGGCAACAACGACCTCTAAAATCTCTTGGTAGGTCCGCCGTAACTAAAGTCTGTCCCCTCTCAGTCTCCTCCACGC 
CTCCCTCTCTTTACCTCTATATAACAAAGCCCTTGAGGATCTGATGTACCT^ATCAAAACAAGATCGCACGGCACTTCATCATTCTCACGAAGCC 
ACACTA AATTACCGACACATATATCCTGCGC TTGAATATGTTTCTTGATCCCCTTCCTAACAATCCAAACTTGCTTCTCATTCGCCTTCTTATC 
AGAC/UUVTATCTATTATAATCATCCCAAGCCTCTTTCGTCGCAGACACAGCAAATATAAAAATCAACGGCCCCCAAGTGCTCGCCGGATTCACC 
GGGGTAATAAGCGACCATAATTGAAGACACGCAATCAACAAGAAATACTGGTTCATAAACCGACTGAACTGCTCCATGAGATTTTTAGGGAGGA 
AATTGAGGAGAGTGTATTTCCGATTGGAAATGCGGTTGTCGCAGTAAAGATCATGTGTAGGCGATGACTCATCGTCGTTGATGTAGACGAACCG 
CTTCATTTTCTTTGATTGACTTTGATTGATTGATTTACATCATCAACTCCAGACGACTACGCATTCGTTACCGT 
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Exhibit 4 

Example of the isolation of a border-like sequence from potato DNA using the two-step PCR 
method described in Example 1 . 



Step 


Reference in 
USSN 
10/607,538 


Sequence 


1 


Paragraph 333 


1 St 1 1 „_ 

1 border 
primer 




Paragraph 333 


Anchor primer 


GACCACACCCGTCCTGTG 


Paragraph 335 


1 st amplified 
product 


CAGGATATATGTGAGGGTAAACAAATAACAACTAAACATGGA 

7\ t\ m t\ /»- » 7\ m t\ m TV f* m /— • t\ t\ t\ /-> t\ rp 7\ 7\ f^T TV 7\ A 7A TA T 1 T 1 f"* 7A 7\ T" A fZ T* 

AAT AG AT AT AC T G AAAC A I AAL 1 AAAAA 1 1 bj-fi/\^ l i 
T C AAT AAC C AAG T AC T AAT CAT G AAG AT AAC AT G C AG AA 
C T G AAT AC T G AAAT AAAT G T T G AAT AC AT G G T C AAT GC A 
t c a t cz a a t c t c a C T C T nnr^ A CC T AC T AT T AAC T GAC AT A 
AACCATGTGAGATAAACACGGAATCTGATGTATATTCCC 
T AT CGAGAGG AC C T AAT G T ACC T T G T C GG GG TAT AAAAT 
CTATGGCGTGATCCACAGGACGGGTGTGGTC 


Parapranh ^"36 


Expected 

border-like 
sequence 


CAGGATATATGTGAGGGTAAAC 


2 


Paragraph 337 


2 reverse 
primers 


CAGTATTCAGTTCTGCATGTTATC (1 st ) and 
GTACTTGGTTATTGAACCTAGTTC ( 2 nd ) 


2 direct 
primers 


CTGACTGTGGGAGCTACTATTAAC ( 1 st ) and 
GAT G TAT AT T C C C T AT C GAG AG GAC { 2 nd ) 


Restriction site 


ATGCAT (Nsil) 


Paragraph 337 


iPCR product 
(up to 
blunt- 
ended site) 


CTTGGTTATTGAACCTAGTTCAATTTTTAGTTATGTTTCAGT 
ATATCTATTTCCATGTTTAGTTGTTATTTGTTTAGCTTA 
GC AT AT AT CC T GC AT GC 


Actual border- 
like 

sequence 


GTTTAGCTTAGCATATATCCTGCAT 




Paragraph 339-346 


Activity as 
border 


Yes, with 89.7% of the activity of a 
conventional T-DNA border (reported 
4/24/04) 
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Exhibit 5 



riant 


sequence 


▼ ClllM 

pSIM 


V_ \J I t 


F!fficiencv 

IjlllClvllVJ 

% T-DNA 


Method** 




ANGATNTATNNNNNNGT 










A t 

At 


T G A P AGGATATATCGTGATGTC AAC 


564 


I 


64 


AL 13 8652 


ot 


ATP P AGGAT AT ATGC TAAGC T AAAC 


605 


H 


60 


PCR 


ot 


A P P P AHGATAT AT TTGCAGP T AGAA 


716 




0.4 


SGN-U 180201 


o* 
LM 


a p p p a cxz a t a t a t t t t natzG t a g G t 


1 j 1 


f 

X 


43 


SGN-U 179068 


ot 


PPPP AP^iAT AT ATTTGTTGGT A AT G 


SSI 


I 

X 


100 


PCR 


c+ 
bt 


A PGP AP^IATATATTTTGGGGT AAAC 


565 


I 


86 


PCR 


c+ 
ot 


rr; n p AfiGATATATAAAGTGT AAAA C 


566 


H 


6.6 


PCR 




P P P P A A T AT AT ACAAGT P T AAAC 




H 


17 


BI421743 




P PGP AGGAT AT ATT TTGGGGT AG GT 

^juu^-rlVjVjrVXrixrix x x xv7V7wV7xnvjo x 


732 


I 


25 


SGN U 15263 5 | 




APPPAnniATATATTTGC'AGPTAGAA 


71 0 


H 


.17 


SGN Ul 58307 




P P P P A PGA T AT AT TGAGAGGT AG AG 


560 


I 


102 


SGN Ul 58307 




PPPPAP-GATATATTCAGAGGTATGC i 


614 


I 


98.5 


SGN U160333 


T a. 


TT Z\r A P_P AT AT AT APAOTPiTT A APG 
1 l/iLnbunXAXnXA^nvalVjl x r"\x*"i.Vw 




H 


g 


OB683 




T P P P A PP: A T A T A T GGPPiAGGAA P AA 


720 

/ Xi V 


H 


2.4 


SGN-U155745 


T 


A P G P A CG A T AT AT CATCGGT AC T T — 

nvOvAVjunXAXnxunx^/\jvjinv/^ x 




H 


5.5 


AY0 13255 


T a 


rnCPttCG2iTATATTTTGGGGTAflGT 

buU^AVjV9inXAXAX X X XUVVTVJX AVJVJ X 


721 


I 


30 


SGN-U 153074 




t t z\ p Ann a t a t a tczc a aaat aa t T P 

1 Irt^^uuiAlAXAXVaunAAAinni x 


611 


H 


6.5 


SGN-U 160800 


T a. 


TC^rAGGATATATTTGATGfAATG- 


724 


H 


.9 


SGN-U 162952 


L»e 


T P A P AGGAT ATAT TAGACAGT T T C C 


725 


I 


35 


SGN-U 149980 




A A GP AGGATATATCTACATGGT ACC 


741 


H 


1.1 


AY0 13255 


T <a 


TPPPAPUxATATATTT— GATGCAATP 

1 Uuk/AV9UAX AX AX X X UAXVTwnni 




H 


.19 


AW031162 




PT APAPPATATATTGAGTTPTAAGA 
0 1 nLAuVsnlAXAX xutAWX x \^ x nrivjri 


550 


H 


1.7 


AW618843 




TPTPAPP.ATATATGTCAACPGGAAC 


551 


H 


0.1 


AW398191 




PPPPAPPATATATTTGTTGGTAATG 

vjvjv^AwAWiXAXAX X XVjX xvjuxnni w 


732 


I 


100 


SGN U 15263 5 


MS 


PP PP AGGAHGT AT APAf^AGGT A TAP 

LbuLAbWiUwXAXAUAVjAUwXn x 


844 


H 


168 


PCR 


Mt 


rpppa^lT'ACGTATAr'AGAGGTATAr 
k^loo LAuvnoVj XAXAWiuAw Xnl r\\^ 


S6X 


H 


32 


CA9 19279 


Ms 


rrrPlV^ATGTATArAGAGG'PfiTAr 

^UbLAbUAlVjXAXAUAIoAWJXni AU 




I 


56 


Modified 


HV 


rPrriVCCATGTATrATTTGGTATAT 
uubLAljbAXbXAXVrfAX X xu\7xn x ii x 


570 


I 


90 


TC137112 


IT,, 

HV 


t p p p a pp. a a p t a tp a ppjpPwPiT A G T G 

X u b L AouAnO XAXunVJVTwwwXrivj X \j 


571 


H 


0.3 




T T. . 

HV 


rraPACPATATATAAAGTGTAAAAr 
bbnbAbunlAXAXAAAVjXu liVVVi^ 


566 


H 


6.5 




t>n 


PT 1 APAPPATTTATTATAPAGTAGGG 

K^, 1 RLAuAjAI X XA X XAXAVrfAuxnuuu 


1064 


I 


2.6 


BnL58 


Bn 


T G AC AGGAC AT AT AGAACAGT T C C T 


1065 


H 


4.2 


BnL59 


Zm 


CCACAGGATATATAAATGCGTAAGC 


1063 


I 


9 


CF052264 


Ta 


GGACAGGATATATTGCTCCGTATAT 


928 


I 


78 


Ca500064 


Os 


AG C C AGGAT AT AT CC T T GAGT AAG T 


923 


I 


61 


AL772425 


Os 


ATGCAGGATATATTTTTTTGTAGTG 


924 


I 


51 


AP004161 


Os 


ACACAGGATATATATATACGTACAT 


925 


I 


44 


AP005498 



















































*: core Sequence = ANGATNTATNNNNNNGT (I = identical; H = homologous) 
**: method: Sequences isolated by using PCR-based methods are indicated 
with "PCR"; sequences identified in databases are shown with their 
accession number . 
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